Sep 9, 2008 3:57pm
Number of Pages in Site?
Hi ... confused here again, but different topic.
So all these links we see point to the top page, ie. home page of a specific URL ... right? Well, the site I'm
looking at contains hundreds of internal pages. For instance, http://web.archive.org/web/*/http://www.dieoff.org
contains hundreds of pages. Here is the Dec 28, 2006 crawl: http://web.archive.org/web/20061228040340/http://dieoff.org/
and here is a page linked to from above page, apparently crawled about 3 months earlier on Sep 27, 2006:http://web.archive.org/web/20060927090516/dieoff.org/page193.htm
So far, so good. Now, lets say this site has 100 internal pages and 2 weeks later the webmaster makes no changes
(or minor ones) to the home page, but either adds or deletes 80 internal pages to/from the site. As I understand it, on the next crawl, we will see the asterisk (showing site was updated), we will see the identical or slightly updated home page, but will get no indication at all that the overall site has undergone such massive changes. Is that correct ... is this how Wayback currently works?
If I do understand correctly, then there is apparently no way for me to easily track the overall size of my example site (dieoff.org) over the 11 yrs of its existence. I would like to suggest that perhaps Wayback could display number of pages, number of total distinct files (including gif/jpg), number of megabytes, to help us spot the major updates/changes over time.
If I'm correct here, how would I make this suggestion? Do I just email email@example.com?